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The "cloze procedure" (CP) was defined by Taylor (1953) as- a 
technique In which a message Is mutilated by deleting every nth ele- 
ment. The task requires reconstruction of the message In Its orig- 
inal form. Taylor postulated that the "cloze unit" Is dependent on 
total context and the degree of association with the Immediately 
surrounding elements. 

A review of the literature has revealed a lack of evidence re- 
garding the functioning of mentally-retarded children on the cloze 
task (Jordan, 1963; McCarthy, 1964). The present review of research 
explores the variables which affect cloze performance. Our goal Is 
to apply CP In a research program designed to study the language 
functioning of mentally retarded children. 



The Cloze Procedure as a Measure of Readability 

Taylor (1953), whose Interest In CP concerned specifically Its value as a 
test of readability, has hypothesized that words deleted from difficult passages 
are more difficult to replace than words removed from easier passages. He con- 
tended that CP measures the similarity between the patterns that the decoder Is 
anticipating and the patterns that the encoder used. 

Initially, Taylor compared the amount of agreement between cloze scores 
and two common measures of readability (Flesch and Dale-Chall) . He was also 
Interested In the effect of different patterns of word deletion (e.g., random 
deletion vs. eliminating every |vth word; removing "few" vs. "many" words). 

Cloze scores yielded rankings of passages on relative difficulty which were 
similar to those obtained from the Flesch and Dale-Chall formul^'.s. Random 
deletion patterns and every nth systems resulted In comparable ratings If more 
than 16 blanks per passage were supplied. No appreciable Increase In efficiency 
of differentiation was obtained by giving partial credit for relevant synonyms 
substituted for deleted words. 

Taylor also tested the power of CP In differentiating passages of 
prose which differed, by a priori judgment. In reading difficulty. His hypoth- 
eses were that the readability of eight previously-rated passages would depend- 
ably predict the rank order of the cloze scores; that CP would successfully 
differentiate the eight passages, whereas the standard (Flesch and Dale-Chall) 
formulas would not; and that the difference between cloze scores for the various 
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passages would be significant. The internal consistency of the cloze scores 
was also measured by dividing the ^s for each passage into subgroups so that 
performance of those in the upper third (as judged by total scores) could be 
compared with the performances of those in the lower third. 

The results verified the discriminating power of CP. Ranking of read- 
ability by six judges for the eight passages yielded a reliable prediction of 
the order of difficulty obtained’ from the cloze scores. These scores came 
closer than either of the standard formulas in ranking the relative readability 
levels of the passages. Analysis of variance indicated significant differences 
in the scores for the eight passages (p < .01). Internal consistency coeffi- 
cients were high enough to indicate that half as many blanks could have been 
used without changing the ranking of passages. In every case, the cumulative 
score of the high scorers quickly diverged from that of the low scorers — 
usually within the first five blanks. Taylor suggested that CP might provide 
a method for contrasting the reading abilities of different individuals, but 
that the potential usefulness of the technique is in no way confined to the 
study of reading. 

In another study Taylor (1954) used a 175-word passage from which every 
fifth word was deleted. Five different cloze versions were constructed by 
starting with a different word so that all words were deleted in one of the 
five cloze tests. Each of the cloze versions was given to a group of 50 ^s. 
Cloze and entropy (degree of uncertainty) scores were determined for each of 
the overlapping forms. The rank-difference correlation of -.87 between cloze 
and entropy scores was interpreted to mean that blanks having high uncertainty 
yield significantly lower cloze scores. 

The Cloze Procedure as a Measure of Reading Comprehension 

Taylor (1956) reported a study designed to measure how well S.s understood 
the meaning of a passage. It was hypothesized that, as understanding increased, 
the probability of providing the correct word increases and, consequently, the 
probability of restoring the* passage to its original form is enhanced. Taylor 
also contended that cloze scores’ would predict intelligence, existing knowledge, 
and success in learning* and remembering. Air Force trainees were used as ^s. 
Comprehension tests for an Air Force technical article were administered before 
and after completion of the article. Cloze tests were constructed by using a 



Bennett 



3 



< 



I 



sample representing 20 per ■ cent of the article. Ten words were "mechanically" 
deleted from each nine llnesi The Ss were first given the cloze test and then 
the "before" comprehension test'. A week later they read the technical article 
which was immediately followed- by administration of the 'Sfter" comprehension 
test, which In tarn was- followed- by another form of the c*oze teat. Correlations 
between cloze scores, comprehension- measures and Intelligence test scores were 
computed. Both "before"' and "after" cloze scores correlated highly with the 
comprehension scores. The Intelligence measure and the "before" cloze test 
scores correlated .73. The "after" cloze and Intelligence test scores correla- 
ted .74. The cloze scores predicted "learning" after study as adequately as 
comprehension test scores s 

Three deletion- patterns were used in another study reported by Taylor 
(1957): (1) deletion of anyword; (2) deletion of "hard" words (nouns, verbs, 
adverbs); and (3) deletion- of "easy" words (verb auxiliaries, conjunctions, 
pronouns, and articles). The "ease" in filling in these words was determined 
by the author from previous studies. The results indicated that in general 
the "any" cloze form yielded more’ stable, reliable and discriminating results 
than did the "easy" or "hard" forms*. The "any" deletion pattern yielded 
correlations with the comprehension- test of between .70 and .88; in all but 
one case these coefficients were larger than those for the hard or easy 
deletion patterns. The findings from this experiment confirm cloze readabil- 
ity scores as valid measures' of ‘ the' comprehension of English prose. 

The Cloze Procedure as a Measure of Reading Proficiency 

The initial studies by Taylor- on the relationship between cloze perfor- 
mance and readability were quickly followed by a series of studies relating 
cloze scores to individual reading proficiency. Notable among these were 
studies reported by Jenkinson (1957) , Rankin (1958), Fletcher (1959), Bormuth 

(1962), and , more recently, Greene (1964). 

Jenkinson (1957) studied CP responses of high school students to three 
deletion conditions: (1) key words (inferred from the surrounding context); 

(2) words from three form classes; and (3) 90 per cent of the high-frequency 
words. Jenkinson found correlations between total cloze scores and the Coopera- 
tive Reading Test of .78 and .73. Significant differences in cloze scores 
across grade levels were paralleled by reading test scores. 
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Rankin (1958) employed CP to assess gains in reading skill after a read- 
ing improvement course. Five cloze tests were developed. In one form, every 
fifth word was deleted. The other four forms deleted nouns and verbs alter- 
nately. Cloze scores were found to be significantly related to intelligence. 

^s who received higher intelligence scores had higher pre- and post-test read- 
ing scores. However, the amount of gain was not related to IQ scores. Reading 
rate was not related significantly to post-reading cloze scores for the passages 
in which nouns and verbs had been deleted. 

Fletcher (1959) related cloze scores to reading comprehension, reading 
rate and general verbal ability. Every fifth word was deleted, irrespective 
of form class. He found correlations of .59 and .57 between the cloze scores 
and two tests of reading rate. On the comprehension test, the correlation 
with the cloze score was .55. A test of general verbal ability (ACE) correlated 
.72 with the CP scores. Fletcher also found that content words were more diffi- 
cult to supply than function words. High-frequency words were the least diffi- 
cult to supply and had little tendency to occur among the items that discriminated 
best. 

Fletcher observed that the CP responses were similar to those frequently 
observed in controlled association studies, (l.e.j a small number of different 
responses were given to each item with one word given a large proportion of 
the time) . Low scorers left more items blank and tended to s^ve uncommon re- 
sponses than did high scorers. 

Three aspects of CP were investigated by Bormuth (1962) : (1) validity as 
a measure of comprehension ability; (2) efficiency in discriminating between 
difficulties of tests and abilities of individuals; and (3) validity as a 
measure of the amount of comprehension with which passages are read. 

Comprehension tests were developed for the nine passages used. Cloze 
tests for each passage were constructed by deleting every fifth word. Only 
exact replacement of the deleted word was scored as correct . Children from 
grades 4, 5, 6 were the ^s. 

The results support the- hypothesis that cloze tests are related to mea- 
sures of comprehension. All correlations between comprehension and cloze 
scores were significant. Item reliability for the cloze test was sufficient 
to Insure efficiency in discriminating both tests and individuals. 
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Greene (1964) conducted* a- study in* which both the standard cloze technique 
(deleting every ^th word) and a modified method were used. In the modified 
procedure, the only words eligible* for deletion were nouns, verbs, adjectives, 
and adverbs. In this latter* condition', each deleted word was evaluated for 
its value as a test item by examining the redundancy remaining in the passage 
and selecting those items which it was felt could be replaced by a superior 

reader . 

Greene was specifically interested in the relative efficiency of the two 
deletion procedures in predicting scores on standardized reading achievement 
tests. No significant differences' were found between the two procedures as 
measured by their correlations' with reading tests: both cloze techniques corre- 
lated highly with the reading tests. The modified cloze test was significantly 
more reliable than the standard' CP; Contrary to his prediction, Greene found 
reading speed to be significantly related to cloze test performance. Two voca- 
bulary tests accounted for approximately one-third of the total cloze-test 
variance. Thus, after taking into' consideration the variables of vocabulary 
and rate of reading, a large proportion of the cloze test variance remained 
unaccounted for. 

It has already been noted that Taylor (1956), Jenkinson (1957) and Rankin 
(1958) found that intelligence was an important variable in cloze performance. 
Ruddell (1963) constructed three cloze tests with sentences of high frequency 
and three with sentences of low frequency in oral speech. No significant 
differences were found in the* performance of fourth-graders on the high and 
low frequency tests. However, the' following factors did contribute to per- 
formance on both kinds of cloze' tests: (1) father's occupation; (2) mother's 
educational background; (3) intelligence; (4) mental age; and (5) chronological 

age. 

Weaver (1961) presented cloze passages aurally or by having read and 
complete the passages. Cloze tests were constructed with differing criteria 
for deletion: (1) every nth word; and (2) deletion of content words (nouns, 
verbs). He found that the "any-word" pattern of deletion was easier to solve 
when read by the S, but that the content-word pattern of deletion yielded simi- 
lar results in both the listening and reading conditions. Weaver suggests 
that a structural deletion pattern (every nth word) is more difficult when S 
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listens (instead of reading) because the information sequence and mode of pre- 
sentation are under the control of the speaker rather than the 

Effect of Amount and Type of Context on Cloze Performance 

Next to be considered is a series of studies dealing with the effect of 
the amount and type of context- on' cloze proficiency. Although it did not in- 
volve deletion of words, a study by Burton and Licklider (1955) is relevant 
to the discussion. These authors studied the extent to which estimates of the 
redundancy of English texts are dependent upon the number of preceding letters 
known to the They found that, although relative redundancy increased as 
knowledge of the foregoing' text approached 32 letters, providing more than 32 
letters did not result in a noticeable increase in performance. 

Miller and Friedman (1957) eliminated or substituted letters in printed 
English texts, added extra letters, etc. They found that "average" £s were 
unable to correct such passages if more than 10 per cent of the letters were 
altered. Random substitution of letters was clearly the most difficult condi- 
tion. Superior _Ss with unlimited time were able to correct passages with as 
much as 50 per cent of the text "mutilated". 

Using complete prose paragraphs, MacGini tie (1961) studied the effects 
of different patterns of omission on the difficulty of restoring periodically 
omitted words. He found that words were equally restorable when every 24th, 
12th or 6th word was omitted but, when every third word was omitted, restora- 
tion was considerably more difficult. Words omitted in groups of four were 
more difficult to replace than words omitted in pairs, and pairs were more 
difficult than single words. MacGinitie concluded that context more than 
five words distant from the omitted word has relatively little effect upon 
cloze test performance. 

The results of the MacGinitie study appear to be related to the Burton 
and Licklider study in that both indicated a basic limitation in the length 
of units which can be restored from context. The redundancy of our language 
is primarily effective for small segments of a message. That is, if omitted 
segments are small they can be restored with great accuracy. If groups of 
words or letters are omitted, the loss is usually irretrievable from context, 
even though such groups may be few and widely separated. 
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Shepard (1963) required Ss to* produce a list of words which could be used 
to fill a blank representing- a word deleted from a passage of text. The amount 
of context that originally surrounded the deleted word was varied. The hypo- 
thesis tested was that as the- amount’ of context was increased, the number of 
alternative words- that could fif in- the- blank decreased. The amount of con- 
text used was 1,2,4,6,10, or 40 words (as they occurred in original passages 
in the New York Times) . - Ss were given five minutes to list as many words as 
possible that might be- placed- in* the- missing blank. Shepard found that the 
average rate of responding- fell from- about 20 words per minute, when the small- 
est amount of context was provided-, to two words per minute when the maximum 
amount of context was- given-. • Shepard- stated that, since rate of producing 
words may be a function of uncertainty associated with a category, uncertainty 
might be inferred from rate. 



Effect' of - Form- Class on Cloze Proficiency 
Another series of studies* has been devoted to testing the effects of form 
clasa.on the relative ease in filling in cloze passages. Both Jenkinson (1957) 
and Fletcher (1959) found that* content words were more difficult to replace 



than function words. 

Fillenbaum, Jones, and Rapoport (1963) investigated the effect of rate 
of deletion upon predictability* of form class replacement in a cloze task. A 
sample of continuous speech elicited by TAT cards, was used with deletion of 
every second, third, fourth, fifth, or sixth word. Each deleted word and each 
word supplied by the S was classified into one of eleven grammatical categories. 

The results showed that, with every second word deleted, the form class of 
the missing word was correctly identified in nearly two-thirds of the cases. 



The largest increase in predictability occurred between deletion rates .f every 
two and every three words. When the data were scored for exact agreement be- 
tween the words of the original passage and responses of the Ss (verbatim re- 
sponses) , adjectives were found to have been most difficult to replace. In 
form class identification, Ss had* most difficulty with adverb and quantifier 
slots. Nouns were easiest to replace, followed by prepositions, then pronouns, 
auxiliary verbs, adjectives, and articles. In general, "syntactic words" (pre- 
positions, conjunctions, articles, auxiliary verbs) were more easily replaced 
than "semantic words" (nouns, verbs, and adjectives). 
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One hundred college students were tested by Coleman and Blumfeld (1963) 
using nominalized sentences (e.g., "our goal is the achievement of the high- 
est good for society, to be attained through a correct definition of names.") 
and grammatical transformations of these sentences with active verbs (e.g., 

"Our goal is that we achieve the highest good for society and this is attained 
when we define names correct _y."). Every fifth word was eliminated and re- 
placed by a blank 10 spaces long. On the average, 9.63 words per £ were supplied 
correctly in the nominalized sentences and 10.80 in the sentences using active 
verbs. The mean number of blanks correctly filled in for content words was 
1,44 for nominalized sentences, in contrast to 2.22 for active verb sentences; 
a significant difference ( p < .01). Proficiency in filling in function words 
was not significantly different for the two types of sentences. 

An analysis of correct responses by form-class revealed that nouns, verbs, 
adjectives and adverbs were equally difficult to replace, whereas pronouns and 
function words were more easily supplied. Whatever the form class, ^s were able 
to fill in more blanks in sentences with active verbs than in nominalized sen- 
tences. 

The percentages of correct responses by form class are lower than those 
found by Aborn, Rubinstein, and Sterling (1959), who were interested in the 
constraint upon words attributable to the amount, distribution, and structure 
of context in cloze sentences. Sentences of 6, 11, and 25 words each were 
drawn from popular magazines. Each word of every sample was classified accord- 
ing to form class: i.e., noun, verb, adjective, adverb, pronoun, or function 
word. Function words included articles, conjunctions, prepositions, auxiliary 
verbs, interjections, and quantity words. Words were deleted in one of four 
positions: (1) initial position; (2) early medial (the third word in six-word 

sentences, the fourth word in 11-word sentences, and the eighth word in 25- 
word sentences); (3) late medial (fourth word in six-word sentences, eighth 
word in 11-word sentences, and the 17th word in 25-word sentences); and (4) 
final position. Sentences were chosen so that an equal number began with 

words of each of the six different form classes. 

Function words and pronouns were found to be considerably more predictable 
than words of the other form classes. For the most part, the percentage of words 
correctly given was a function of the size of the class, i.e., the smaller the 
frequency of words in that class, the greater the probability of correct predic- 
tion. The descending hierarchy of classes according to size would appear from 
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these data and those of Fries (1952) to be as follows: nouns, verbs, adjec- 
ttoes adverbs, function words, and pronouns. The tendency to draw responses 
from a class other than- that- of the omitted word was most pronounced in the 

case of the adjective and adverb. 

In analyzing for the- position' effect , it was found that words omitted 
in the medial positions were more predictable than words omitted in the 
initial or final positions. Thus, as reported by Miller and Friedman (1957), 
bilateral context exerts greater constraint than unilateral context (regard- 
less of whether the context precedes or follows the missing word). The re- 
sults indicate tnat predictability of the missing word increases with increas- 
ing sentence length, but that this reaches an asymptote, with no noticeable 
increase in predictability for the 2 5 -word sentences over the 11-word sentences. 

It thus appears that constraint' reaches its maximum somewhere between sentence 

lengths of five and ten words. 

Contribution of Syntactical and- Semantic Components to Cloze Performance 

In 1961 MacGinitie stated that: "Although various sources of constraint 

(e.g., topic, syntax, semantic reasonableness) can be distinguished concep- 
tually. it has long been recognized that it is difficult in completion experi- 
ments to separate their effects" (p. 128) . A study by Salzinger. Portnoy, 
and Feldman (1962) attempted to investigate the semantic and syntactic factors 

in verbal behavior using CP. 

Fifty-word passages developed by Miller and Selfridge (1950) at each of 
eight orders of statistical approximation to English (zero, f irst ... seventh . 
text) were used. Either every fifth or every seventh word was deleted by 
Salzinger et al. The results indicated that as the order of approximation 
approached normal English, more deleted words were correctly supplied. Further, 
these authors found, as did MacGinitie (1961) and Aborn. Rubinstein. End Sterling. 
(1959). that ^s do not or cannot make use of more- than five words on either side 
of each blank. By classifying both the incorrect and correct cloze responses, 
it was discovered that, with increasing order of approximation to English. Ss 
gave increasingly more words -of the same form class as the deleted word; how 
ever, the effect did not increase appreciably beyond Order 3. Salzinger et al. 
suggest that improvement in memory for sentences must be attributed: primarily 
to increased syntactical structure between Orders 1 and 2, about equally to 
syntax and meaning between Orders 2 and 3. and primarily to meaning beyond Order 3 
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Miscellaneous Variables in Close Performance 

Manis and Dawes (1961) studied the variable of attitudes toward the "mu- 
tilated" found that passage responses to blanks formed by deletion of content 
words did not reflect differences attributable to previously-expressed agreement 
or disagreement with the passage i while responses to function words were affected 
by prior opinion. 

Fillenbaura and Jones (1962) report a study which attempts to measure, via 
CP, the extent of deviation of aphasic speech from normal speech. Cloze tests 
were constructed by deleting every fifth word from speech transcripts of normal 
and aphasic £s. College students then were given these cloze tests, and the 
number of correctly-replaced words in the normal transcript was compared to that 
in the aphasic transcript. 

Cloze scores were significantly higher for passages from "normal" speech 
than from aphasic speech. Even if a rater’s completion of the cloze blank was 
of the correct form class, the probability that his verbatim prediction was 
correct was consistently less for passages generated by aphasics than for pas- 
sages taken from normals. Two cloze scores, fonr. class predictability and 
verbatim predictability, clearly discriminated between texts; these scores 
appear to have contributed independently to such discrimination. 

Fillenbaura and Jones suggest that the conditional proportion of correct 
verbatim completions given correct 'form— class substitution might serve as an 
especially powerful index of the amount of semantic difficulty of an aphasic 
speaker. Low predictability of form class would suggest syntactic problems. 

The authors stated that it might be possible to examine the predictability of 
different form classes, thereby specifying more- precisely the- locus of change 
or damage in speech. 



Summary ana Discussion 

Since the early reaoability stuaies by Taylor (1933, 1934), much progress 
has been made in oelineating the variables contributing to cloze performance. 
Taylor (1936, 1937) found that cloze scores- correlatea significantly with 
tests of reaaing comprehension ana general intelligence. Jenkinson (1937) , 
Rankin (l93o) , Fletcher (1939) , bormuth (1962) and Greene (19b4) all found 
significant correlations between individual reaaing ability ana cloze perform- 
ance. The effect of intelligence on cloze proficiency was also confirmea. 
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Ruddell (1963) found socio-economic class, mental age and chronological age 
to be significant factors in performance on close passages t 

Another series of studies was concerned with the effects of amount of 
context, form class of words deleted and distribution of the deleted items ■ 
Burton and Licklider (1955), Shepard (1963), and Fillenbaum, Jones and 
Rapoport (1963) , found that the amount of available context affected ability 
to fill in close blanks. Jenkinson (1957), Fletcher (1959), Abom, Rubinstein, 
and Sterling (1959), and Ckjleman and Blumfeld (1963), and Fillenbaum, Jones, 
and Rapoport (1963) , discovered that deleted function words were easier to 
supply than content words. This has been explained on the basis of the num- 
ber of words in the form class, i.e., there are comparatively fewer function 
words than content words, resulting in fewer alternatives and therefore greater 
predictability. Abom, Rubinstein, and Sterling (1959) also found that the 
distribution of the deleted word (deletion in initial, medial or final positions 
of the sentence) affected the ability to correctly supply the missing word. 

It was shown that blanks in the initial or final positions were more diffi- 
cult to fill than those in medial positions. Abom, et al. concluded, as had 
Miller and Friedman (1957), that bilateral context exerts greater constraint 
than does unilateral context. 

Salsinger, Portnoy, and Feldman (1962) attempted to differentiate the 
effects of syntax and semantics in the recall of sentences of different orders 
of approximation to English. Using the Miller and SeLfridge 50-word passages. 
Cloze tests were constructed to evaluate the nature of responding at each of 
the eight orders of approximation to English. It was found that with increas- 
ing order of approximation, ^s moved from a reliance on S 3 ^tactical cues to a 
semantic level of functioning. Manis and Dawes (1961) determined that the 
effect of agreement or disagreement with the passage read affects mostly the 
function words. 

The Fillenbaum and Jones (1962) study presents evidence that aphasic 
speech may be objectively evaluated by means of the cloze technique. This 
study with aphasics highlights the potential of the cloze technique in deal- 
ing with language disorders. It may be possible to apply the cloze procedure 
to the study of verbal development in retarded children. Perhaps it would 
be possible to determine the extent to which retardates have learned the 
’’equivalence'* of words from the same form class. In fact, it might be possible 
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to make a prediction as to the relative proficiency on the close test based 
upon the amount of paradigmatic responding (Semmel, Barrltt, Bennett, and 
Perfettl, 1966) on a free-assoclatlon task. It can be hypothesised that J.s 
giving paradigmatic responses will have In their repertoires the largest 
store of available "equivalent" words substitutable for the missing word. 

This would result In higher close scores when scoring was done on the basis 
of whether the substituted word was of the same form class as the deleted 
words. This procedure promises to yield data concerning the amount of In- 
formation retardates obtain from the grammatical structure of connected 
discourse. 

Another factor of significance may be the extent to which ^ Is capable 
of paradigmatic responding. The pattern of associations between Individual 
words may help to explain cloze responses. Some support for the organizing 
effect of assoclatlonal bonds and the consequent effect on cloze performance 
Is obtained In the Fletcher (1959) study, where It was found that high 
scorers showed more commonality of responding than low scorers. Another 
way of scoring the cloze data would be for exact (verbatim) agreement be- 
tween the word deleted and the word supplied by the S^. Flllenbaum and Jones 
(1962) suggest that this may give some Indication as to the level of semantic 
development . 

One variable that has not been studied Is the effect of different levels 
of complexity upon cloze responses. It would seem appropriate to begin a 
study of this kind at the phrase level, using various types of phrases (e.g., 
noun, verb, and prepositional phrases), noting the relative ease with which 
missing words can be supplied. Then simple sentences could be formulated, 
containing different orderings of the constituent form classes (e.g., "Go 
to the store," "The home Is white,", etc.). Thus, not only would It be possi- 
ble to judge the relative difficulty ^s have In supplying words of certain 
form classes but also the Impact of particular form-class orders. Next, 
cloze performance on phrases and simple, disconnected sentences could be 
contrasted with performance on continuous prose. It might be expected that 
retarded children and children with speech disorders would have progressively 
more difficulty with the cloze task In moving from the phrase level to con- 
tinuous prose, partially because of Increasing constraint due to context. 
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We might also speculate that because of greater familiarity with nouns and 
verbs y retarded ^s would be most adept at filling in close blanks reciulrlng 
these form classes. It seems likely from previous studies that close per- 
formance will be related to both chronological age and mental age, irrespect- 
ive of the level of language sampled. 

The efficiency of the close procedure in measuring comprehension of a 
passage of prose should be evaluated by correlating close scores with differ- 
ent types of comprehension tests (e.g., essay, short answer, multiple-choice). 

It may be that the high correlations found between comprehension tests and 
close tests are an artifact of the type of test given. It would seem on an 
a priori basis that multiple-choice and short-answer tests would correlate 
more highly with the close task than would an essay test. 

Taylor (1953) said this about the close procedure: "...a close score 

appears to be a measure of the aggregate influences of all factor which 
interact to affect the degree of correspondence between the language patterns 
of transmitter and receiver. As such, its potential usefulness is by no 
means confined either to readability or the reading abilities of individuals 

(p. 432). 

Future research with the close procedure should attempt to further differ- 
entiate the contributing "aggregate" influences mentioned by Taylor. The use of 
the close technique with new populations may not only solve Important theoreti- 
cal Issues, but may also prove «o be a valuable index of verbal functioning 
with important predictive characteristics. 
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